Search CORE

42 research outputs found

An Exploration of Neural Sequence-to-Sequence Architectures for Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue
Publication date: 30/09/2017
Field of study

In this work, we explore multiple neural architectures adapted for the task of automatic post-editing of machine translation output. We focus on neural end-to-end models that combine both inputs

mt

(raw MT output) and

src

(source language input) in a single neural architecture, modeling

\{mt, src\} \rightarrow pe

directly. Apart from that, we investigate the influence of hard-attention models which seem to be well-suited for monolingual tasks, as well as combinations of both ideas. We report results on data sets provided during the WMT-2016 shared task on automatic post-editing and can demonstrate that dual-attention models that incorporate all available data in the APE scenario in a single model improve on the best shared task system and on all other published results after the shared task. Dual-attention models that are combined with hard attention remain competitive despite applying fewer changes to the input.Comment: Accepted for presentation at IJCNLP 201

arXiv.org e-Print Archive

Edinburgh Research Explorer

Near Human-Level Performance in Grammatical Error Correction with Hybrid Machine Translation

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

We combine two of the most popular approaches to automated Grammatical Error Correction (GEC): GEC based on Statistical Machine Translation (SMT) and GEC based on Neural Machine Translation (NMT). The hybrid system achieves new state-of-the-art results on the CoNLL-2014 and JFLEG benchmarks. This GEC system preserves the accuracy of SMT output and, at the same time, generates more fluent sentences as it typical for NMT. Our analysis shows that the created systems are closer to reaching human-level performance than any other GEC system reported so far.Comment: Accepted for oral presentation, research track, short papers, at NAACL 201

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Minimally-Augmented Grammatical Error Correction

Author: Grundkiewicz Roman
Junczys-Dowmuntz Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Crossref

Edinburgh Research Explorer

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data

Author: Grundkiewicz Roman
Heafield Kenneth
Junczys-Dowmuntz Marcin
Publication venue
Publication date: 01/01/2019
Field of study

Crossref

Edinburgh Research Explorer

Findings of the WMT 2021 Shared Task on Efficient Translation

Author: Grundkiewicz Roman
Heafield Kenneth
Zhu Qianqian
Publication venue
Publication date: 10/11/2021
Field of study

Edinburgh Research Explorer

Neural Machine Translation Techniques for Named Entity Transliteration

Author: Grundkiewicz Roman
Heafield Kenneth
Publication venue
Publication date: 01/01/2018
Field of study

Crossref

Edinburgh Research Explorer

The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Crossref

Edinburgh Research Explorer

Log-linear Combinations of Monolingual and Bilingual Neural Machine Translation Models for Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

This paper describes the submission of the AMU (Adam Mickiewicz University) team to the Automatic Post-Editing (APE) task of WMT 2016. We explore the application of neural translation models to the APE problem and achieve good results by treating different models as components in a log-linear model, allowing for multiple inputs (the MT-output and the source) that are decoded to the same target language (post-edited translations). A simple string-matching penalty integrated within the log-linear model is used to control for higher faithfulness with regard to the raw machine translation output. To overcome the problem of too little training data, we generate large amounts of artificial data. Our submission improves over the uncorrected baseline on the unseen test set by -3.2\% TER and +5.5\% BLEU and outperforms any other system submitted to the shared-task by a large margin.Comment: Submission to the WMT 2016 shared task on Automatic Post-Editin

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

MS-UEdin Submission to the WMT2018 APE Shared Task:Dual-Source Transformer for Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue
Publication date: 01/01/2018
Field of study

This paper describes the Microsoft and University of Edinburgh submission to the Automatic Post-editing shared task at WMT2018. Based on training data and systems from the WMT2017 shared task, we re-implement our own models from the last shared task and introduce improvements based on extensive parameter sharing. Next we experiment with our implementation of dual-source transformer models and data selection for the IT domain. Our submissions decisively wins the SMT post-editing sub-task establishing the new state-of-the-art and is a very close second (or equal, 16.46 vs 16.50 TER) in the NMT sub-task. Based on the rather weak results in the NMT sub-task, we hypothesize that neural-on-neural APE might not be actually useful.Comment: Winning submissions for WMT2018 APE shared tas

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

The AMU System in the CoNLL-2014 Shared Task: Grammatical Error Correction by Data-Intensive and Feature-Rich Statistical Machine Translation

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue
Publication date: 01/06/2014
Field of study

Edinburgh Research Explorer